PFIMII: Parallel Frequent Itemset Mining using Interval Intersection

نویسندگان

  • Neelam Duhan
  • Parul Tomar
  • Amit Siwach
  • Jiawei Han
  • Micheline kamber
  • Siddharth Shah
  • N. C. chauhan
  • S. D. Bhanderi
  • H. Li
  • Yi Wang
  • Vania Utami
  • Ashok Savasere
  • Edward Omiecinski
  • Shamkant Navathe
چکیده

Data Mining techniques are helpful to uncover the hidden predictive patterns from large masses of data. Frequent item set mining also called Market Basket Analysis is one the most famous and widely used data mining technique for finding most recurrent itemsets in large sized transactional databases. Many methods are devised by researchers in this field to carry out this task, some of these are Apriori, Partitioning approach and Interval Intersection etc. In this paper, a new approach is being proposed to find the frequent item sets using Interval Intersection and Apriori Algorithm, which produces results in parallel on several partitions of dataset. For representing the item sets, interval sets are used and for calculating the support count, interval intersection operation is used. The experimental results indicate that the proposed approach is accurate and produces results faster than Apriori Algorithm.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Generating Frequent Patterns Through Intersection Between Transactions

the problem of frequent itemset mining is considered in this paper. One new technique proposed to generate frequent patterns in large databases without time-consuming candidate generation. This technique is based on focusing on transaction instead of concentrating on itemset. This algorithm based on take intersection between one transaction and others transaction and the maximum shared items be...

متن کامل

An Algorithm for Mining Maximum Frequent Itemsets Using Data-sets Condensing and Intersection Pruning

Discovering maximal frequent itemset is a key issue in data mining; the Apriori-like algorithms use candidate itemsets generating/testing method, but this approach is highly time-consuming. To look for an algorithm that can avoid the generating of vast volume of candidate itemsets, nor the generating of frequent pattern tree, DCIP algorithm uses data-set condensing and intersection pruning to f...

متن کامل

Accelerating Parallel Frequent Itemset Mining on Graphics Processors with Sorting

Frequent Itemset Mining (FIM) is one of the most investigated fields of data mining. The goal of Frequent Itemset Mining (FIM) is to find the most frequently-occurring subsets from the transactions within a database. Many methods have been proposed to solve this problem, and the Apriori algorithm is one of the best known methods for frequent Itemset mining (FIM) in a transactional database. In ...

متن کامل

Optimization Of Intersecting Algorithm For Transactions Of Closed Frequent Item Sets In Data Mining

Data mining is the computer-assisted process of information analysis. Mining frequent itemsets is a fundamental task in data mining. Unfortunately the number of frequent itemsets describing the data is often too large to comprehend. This problem has been attacked by condensed representations of frequent itemsets that are sub collections of frequent itemsets containing only the frequent itemsets...

متن کامل

Weighted Itemset Mining from Bigdata using Hadoop

Data items have been extracted using an empirical data mining technique called frequent itemset mining. In majority of theapplication contexts items are enriched with weights. Pushing an item weights into the itemset extraction process, i.e., mining weighted itemsets rather than traditional itemsets, is an appealing research direction. Although many efficient weighteditemset mining algorithms a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016